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Abstract 

We present a new approach to construction of protocols which are proof 
against communication errors. The construction is based on a general- 
ization of the well known Ulam's game. We show equivalence between 
winning strategies in this game and robust protocols for multi-party com- 
putation. We do not give any complete theory. We want rather to describe 
a new fresh idea. We use a tree code defined by Schulman. The tree code 
is the most important part of the interactive version of Shannon's Coding 
Theorem proved by Schulman. He uses probabilistic argument for the 
existence of a tree code without giving any effective construction. We 
show another proof yielding a randomized construction which in contrary 
to his proof almost surely gives a good code. Moreover our construction 
uses much smaller alphabet. 



1 Introduction 

We consider a generalization of the communication model introduced by Yao 
9 . Processors P%, P2, . . . , Pfc compute a fc-argument function j:X\ x X2 x 
... x Xk — > Y. Initially processor P, knows only the argument Xi S X;. In 
each communication step some of the processors send messages to some other 
processors. Each message consists of bits. The communication terminates when 
all the parties know the value f(x%, X2, ■ ■ ■ , Xf.). The communication complexity 
of the function / is the minimum number of exchanged bits which are required 
to solve the problem, measured on a "worst" input. 

In this paper we add noise to the communication. We consider two kinds of 
noisy binary symmetric channels. In an probabilistic channel bits are incorrectly 
transmitted with a given probability e > 0. In an adversarial channel the 
fraction of incorrectly transmitted bits is at most e > 0. We assume that noise 
can cause bit inversion but bits are never lost and never any extra bit is added. 

'Supported by the European Community Research Training Network Games and Polish 
KBN grant No. 4 T11C 042 25. 
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Our aim is to construct a protocol which is proof against noise, computes 
function / and exchanges n = O(no) bits, where no denotes the communication 
complexity in the case when the processors are connected by noiseless chan- 
nels and the constants implicit in the O(-) notation depend only on e. For an 
adversarial channel the protocol should always return a good result. For a prob- 
abilistic channel it should fail with probability bounded by e~ ni - n "\ Rajagoplan 
and Schulman 0J|S] show that this is possible for a probabilistic channel. More- 
over Schulman concludes in 5 about an adversarial channel in the case of two 
processors. But the proofs in those papers are not constructive. 

We concentrate on protocols for an adversarial channel. It should be clear 
that the robustness against an adversarial channel is a stronger property than 
the robustness against a probabilistic channel. The following Lemma states this 
precisely. 

Lemma 1 If a protocol V exchanges n bits and is proof against any number of 
errors less than eon, then the protocol V fails with probability bounded by e~^ n ^ 
when running on a probabilistic channel with error probability e < eq. 

Proof Let X be a random variable denoting the number of inverted bits during a 
run of V on a probabilistic channel. Because transmission events are statistically 
independent therefore X has binomial distribution with probability e of the 
success in a single event. 

Running on a probabilistic channel, V fails with probability Vx(X > eon). 
From Tchebyshev exponential inequality we have that for any A > 

Pr(X > s n) < -j = [ . 

It is sufficient to show that /(A) = e Ae ° — ee x — 1 + e > for some A > 0. 
We have /(0) = and /'(0) = e Q - e > 0. Hence /(A) > for some A > 0. □ 

Our construction is based on a generalization of Ulam's game IB] . We show 
the equivalence between a winning strategy in this game and a robust proto- 
col. Ulam's game is widely considered in connection with one-direction data 
transmission over noisy channel and error correcting codes. However, its appli- 
cation requires the presence of noiseless feedback ^0], which is not realistic. 
We overcome the problem of noiseless feedback using a tree code introduced by 
Schulman [Jy. 

We hope that the ideas presented in this paper can bring the results of |S] 
closer to the edge of practical applicability. Our approach seems to be more 
practical because we use much shorter alphabet and we do not need back steps 
in protocol (like in @J|E]), which make it much simpler. 

2 Tree Code 

Let us remind definition of Schulman's tree code from |F] . 

Definition 1 A d-ary tree code over alphabet S, of distance parameter a and 
depth n, is a d-ary tree of depth n in which every arc of the tree is labeled 
with a character from the alphabet S subject to the following condition. Let u 
and v be any two nodes at some common depth h in the tree. Let h — I be 
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the depth of their least common ancestor. Let a\a 2 . . . ah and bib 2 • ■ ■ bh be the 
concatenation of the letters on the arcs leading from the root of the tree to u and 
v, respectively. Then H{ah-i+\CLh-i+2 ■ ■ ■ a hl b h ^i +1 b h -i +2 ■ ■ ■ b h ) > al, where 
the Hamming distance H counts the number of positions i in which et^ ^ bi . 

We focus on the binary case d = 2 with distance parameter a = 5. Schulman 
[Hj proves that an alphabet of size 95 suffices to construct such a code. The fol- 
lowing theorem shows that only 16 letters are sufficient and achieved Hamming 
distance is a little better than |, as shown on Figure^ Moreover our proof con- 
tains a randomized construction which in contrary to the proof from 5 almost 
surely gives a good code. A code over the alphabet S r — {0, 1, 2, . . . , 2 r — 1} is 
called an r-bit code. We can interpret S r as the set of all possible r-bit vectors. 

Theorem 1 For every n > 1 and every r > 4 there exists an r-bit binary tree 
code of depth n satisfying 

(I ifl<r, 

H(a h ^i +1 a h -i +2 ■ ■ ■ a h , b h -i+\bh-i+2 ■■ - b h )> < r ifr<l<2r, 

[l/2 ifl>2r. 




Figure 1: The Hamming distance lower bound for the r-bit binary tree code. 

Proof Let io,ti,t n -i be a sequence of letters from S r such that for i < r 
they are linearly independent bit-vectors, e.g. i; = 2 r ~ l — 1, and the remaining 
elements are chosen randomly with uniform distribution. We label arcs of the 
tree with alphabet S\ = {0, 1} so that arcs outgoing from each node to its sons 
have different labels. If ■ ■ ■ a' h is the concatenation of the labels from the 
root to a node u over alphabet S\ then as the label over alphabet S r for the arc 
leading to the node u from its parent we take 

h 

ah = (J)a^h-i, 

i=i 

where ® denotes bitwise addition modulo two (exclusive or). 

Let u and v be two different nodes at the same depth and w their least 
common ancestor. Let a\a 2 • • • a/ and 6162 ■ ■ ■ h be the concatenations of the 
letters over alphabet S r on the arcs leading from w to u and v respectively. 
The Hamming distance between labels leading from the root to u and v is the 
number of nonzero elements in the sequence C\, c%, . . . , cj, where Ci = © bt. If 
a\a'2 ■ ■ ■ a[ and b'^b' 2 ■ ■ ■ b\ are the concatenations of the letters over alphabet S\ 
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on the arcs leading from ratou and v respectively and d i — a' i © b[ then we have 



c[t a = Cl 

cU © c\t\ = c 2 

c'jto © c\_J\ © ... © ci*/_i = Cl 

Observe that c[ = 1. 

We can choose 2 rijl ~ r > different sequences t r ,t r+1 , . . . ,t n -±. We denote by 
F(n) the number of those among them which violate the code condition (Ham- 
ming distance). Let f{n,l) be the number of sequences which violate the code 
condition first time at distance I from the least common ancestor. Obviously 
we have 

n 

F(n) = J2f(n,l). 
1=1 

Because to, t\, . . . , £ r _i are linearly independent bit-vectors and c[ = 1 then 
c.i ^ for i < r. Hence f(n,l) = for I < 2r. Therefore thesis holds when 
n < 2r or I < 2r. Subsequently we assume that n > 2r and I > 2r. 

If / is even and the code condition is violated for some nodes at distance I 
then the number of zeros in the sequence c±, c<i, . . . , c\ is at least 1/2+ 1. Hence 
the number of zeros in the sequence ci, ca, . . . , c;_i is at least 1/2 and the code 
condition is violated already at distance I — 1. Therefore f(n,l) = when I is 
even. Putting / = 2m + 2r + 1 we get 

\n/2]-r-l 

F(n)= ^ f(n,2m + 2r+l). 

Now we count the number of bad sequences c r +i, c r+ 2, . . . , c/ which can cause 
code violation first time at distance I. We use a notion of walks on the xy-plane, 
see Figure |2 We assign a walk to each sequence c\, c 2 , C3, . . .. The value on the 
x-axis is distance I from the least common ancestor. The value on the y-axis 
is difference between doubled Hamming distance and I. We start at the point 
(0, 0). Then we go by the vector (1, 1) when succeeding c, is nonzero and by the 
vector (1,-1) otherwise. The walk corresponds to a bad sequence if it reaches 
the point (Z — 1,0) never previously touching the line y — — 1 and then it goes to 
the point (I, — 1). Because c; ^ for i < r then after the first r moves we are at 
the point (r, r). So the number of bad walks is the number of walks from (r, r) 
to (I — 1, 0) such that they never touch the line y — — 1. From the reflexion law 
this is the number of walks from (r, r) to (I — 1 , 0) minus the number of walks 
from (r, — r — 2) to (I ~ 1, 0). The number of walks from (x\, yi) to (X2, 2/2) is 

(x 2 -xi+y 2 - yi)/2 

if x-i — x\ + y 2 — yi is even and otherwise. Consequently there are 

/ 2m + r\ ( 2m + r \ / 2m + r\ f 2m + r 
\ m J \m + r + lj \ m J \ m — 1 
2m + r\ m ( 2m + r\ r + 1 / 2m + r 



m J m + r + 1 V m J m + r + lV m 
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bad walks. In each such walk, after the first r moves up, we go to times up and 
m + r + 1 times down. The moves up can be chosen on (2 r — l) m ways because 
we have 2 r — 1 nonzero choices for each Cj. The move down requires choosing 
as Ci. Hence we have 



B{m, r) = 



1 /2m + 

• + 1 \ TTl 



(2 r - iy 



bad sequences c r +i, c 



r+2, ■ 



-r - 2 



A 

ih- 






1 X !T^* 

r / \ i I x 







Figure 2: A walk on the a;y-plane. 
For every r > 1 there exists e > such that for any to > holds 



m + r + 1 2to + r 
It is sufficient to put 2 e = . Therefore we have 

B{m ,r)<^—( 2m + r )r^. 
2m + r \ to J 

From equations JQ) we conclude that for every bad choice of c r +i, c r +2, . . . , c; 
and for every choice of c' x , c' 2 , . . . , cj there is at most one bad sequence of letters 
t r , t r+ i, . . . , ti-i which violates the code condition first time at distance I. We 
have 2 i_1 choices of c^, c' 2 , . . . , c[ because c[ = 1. There are 2 r ( n— " sequences of 
f;, ij+i, . . . , tn-i to choose from. Therefore 

/(n, 2to + 2r + 1) < 2 i - 1+r ( n ^.B(m, r) = 2 r ^-^- r2+r+2< - 1 -^ m B{m, r) < 



< 



2m + r\ „ r („_ r )_ r 2 +r+1+£+ ( 2 _ r ) r 



2m + r V to 

and 



[n/2]-r-l 



F(n) < 2 r ("- r )- r2 + r + 1+e \^ r /2m + r\ 2 ( 2 _ r)m 

^— ' 2to + r \ to / 

m=0 v 7 
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From formulas (5.68), (5.70)] we have that 



E 



2m + r\ m _ (I - VI - 4z 



2m + r \ m I V 2z 

We put z = 2 2 - r . For r > 4 we have that < * < |, 1 - VI - 4z < 2z(l + 4z) 
and (1 + 4z) r = (1 + 2 4 ~ r ) r < 16. 

Replacing the finite sum with an infinite one we get 

F(n) < 2 r (™~ r )~ r2 + r + 5+e . 

Finally probability p, that randomly chosen sequence t r , t r +i, . . . , i n _i violates 
the code condition, does not exceed 2 -r (™ -r ) F '(n) . Hence 



P < 2 



-r^+r+5+e _ r+l -r z +r+5 



which for r > 4 is less than 0.01. □ 
The above proof implies that in practice we can construct a good tree code 
with probability as close to 1 as we need. There are two possibilities. Either 
we choose appropriate r (already for r = 8 we have p < 9 • 2 -54 < 5 • 10~ 16 ) or 
we use computer to find the first q letters to,ti, . . . , which maximize the 
Hamming distance. Then probability p decreases exponentially with q. 

In next sections we use the 4-bit binary tree code. We use the following 
property of such code. Assume we are successively receiving letters 010203 . . . 
describing a path from the root of the tree, and after each letter we are trying to 
identify a node to which this path leads. Then, if the total number of erroneously 
received letters is at most r, the Hamming distance property of the tree code 
causes that we badly identify the node at most 4r times. 

It should be mentioned that the factor 4 is rough. In fact the code described 
in above proof can have better property. 



3 Generalized Ulam's Game 

Let us remind the definition of Ulam's game. There are two players Carole and 
Paul. The names come from Spencer 0. Carole thinks of a number x between 
1 and n. Paul asks at most q questions of the form "does x € F?" , where 
Fc{l,2,...,n}. Carole answers "yes" or "no" and she is permitted to lie at 
most / times. Paul wins if at the end of the game he knows x. 

We propose a generalization of Ulam's game. Let /: X\ x X% x . . . x Xk — > Y 
be a given function. Carole thinks of a tuple (x%, X2, ■ ■ • ,Xk) € ^ixX 2 x . . .xXfc. 
Paul asks at most q questions. In each question he chooses i S {1, 2, . . . , k} and 
he asks "does Xi g Fj?" , where Fi C Xi. Like in the original game Carole 
answers "yes" or "no" and she is permitted to lie at most I times. Paul wins if 
at the end of the game there is a unique possible value of the function /. 

We allow Carole to play an adversary strategy. Carole does not actually pick 
a tuple but answers all questions so that there is always at least one tuple that 
satisfies all answers, except for at most I of them. We denote the generalized 
game by Q (f,q,l). This game is determined. For given f,q,l either Paul or 
Carole has a winning strategy. 

A winning strategy for Paul is represented by a tree. 
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Definition 2 The strategy tree is a labeled binary tree. Each internal node is 
labeled with an index i and a set Fi . Arcs leading from each internal node to its 
sons are labeled with or 1. Leafs are labeled with values of the function f . 

The interpretation is as follows. Paul starts at the root. In each internal 
node he asks the question "does Xi £ Fi?" found in the label of that node. If 
the answer is "yes" then he goes along the arc with label 1 and if the answer 
is "no" then he goes along the arc with label 0. If he reaches a leaf then he 
reads the value of /. The depth of the strategy tree is the pessimistic number 
of questions which have to be asked to win. 

As we see in section ^. II we can construct a winning strategy for Paul in the 
game Q (/, no, 0). However usually it should be possible to construct a winning 
strategy in the game Q (/, \no/(k— 1)],0) because intuitively answering one 
question conforms to sending k — 1 bits. 

For standard Ulam's game, if Carole is required to make sure that the frac- 
tion of lies is less than 1 /3 of her answers then Paul has a winning strategy with 
Q{q) questions, where q = |~log 2 n] is the number of questions which allows Paul 
to win in the game without lies [Jj. We expect that for the generalized game a 
similar result holds. 

Conjecture 1 There exists a constant e g > such that for every function f 
for which Paul has a winning strategy in the game Q (/, n, 0), if the number of 
lies I is less than the fraction e g of all questions q (I < s g q), then Paul has 
a winning strategy in the game Q (f,q,l). Moreover q < C g n, where C g is a 
constant which depends only on l/q. 

Theorem 4 from and considerations in section Hill imply that above con- 
jecture holds for two-argument functions and with e g = 1/240. However we 
cannot construct a strategy because of a non-constructive proof in |S] . To make 
a practical benefit from above conjecture a constructive proof is needed. 

Our main result can be stated not so formally as the following theorem. 

Theorem 2 There exists a lie-proof winning strategy for Paul in generalized 
Ulam's game with the function f if and only if there exists an error-proof protocol 
for k processors computing the function f . 

Next subsections contain the appropriate constructions which explain the 
details. 

3.1 Converting Protocol into Winning Strategy 

Assume we have a protocol for k processors computing function /: X\ x X^ x 
... x Xk — > Y. If some transmissions occur at the same time then we order them 
first by the number of the sender and next by the number of the receiver. So 
we have a protocol in which at every time-step exactly one processor Pi sends 
one bit to the another processor Pj . This bit depends on the input Xi € Xi and 
on bits which were seen by Pi before. 

Paul simulates the run of the protocol. Assume that he knows all bits re- 
ceived by all processors up to the present. So Paul knows the processor Pj which 
should send a bit at present step. Hence he can compute the subset Fi of the 
set Xi such that Xi € Fi if and only if P, sends 1 at the present step. Paul 
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asks "does Xi G Fj?". Answer "yes" means that processor Pj receives 1 and 
answer "no" means that processor Pj receives 0. If Carole tells the truth then 
this corresponds to a proper transmission of the bit. If Carole lies then this 
corresponds to the inversion of the bit during transmission. Hence Paul knows 
bit received at present step. By induction on the step number Paul knows all 
bits received by all processors in the run of the protocol. This sequence of bits 
is called the protocol trace. 

Because at the end of the protocol all processors know the same value of 
the function / then the protocol trace determines this value and Paul knows it, 
as well. Therefore if the protocol exchanges q bits and is proof against / errors 
then we have winning strategy for Paul in the game Q (/, q, I). 

3.2 Converting Winning Strategy into Protocol 

Assume we have a winning strategy for Paul in the game Q (/, q, I). We call it 
shortly a strategy. The state of the game is the location in the strategy tree. 
We identify the state with the tree node. If v is a state then by va we denote 
the son of v to which leads the arc labeled with a, where a G {0, 1}. Let for a 
node v the symbols i(v) and F(v) stand for the index and set used by Paul in 
his question, respectively. I.e., in node v Paul asks "does x^ G F(v)V : . Let 
[ip] be 1 if the condition <p is true and otherwise. 

The strategy corresponds to the following protocol. All processors maintain 
the same state. The initial state is the root. In the state v processor P^ v ) 
broadcasts bit a = [x^ G F(v)] to all other processors. The transmitted bit 
can be inverted, but all the processors receive the same value a. Moreover we 
have to assume that there is a feedback to the sender and it knows what bit has 
been received by others. Next all the processors change their state according to 
the received bit, to the new state va. If processors reach a leaf then the protocol 
ends. It is clear that if the number of faulty broadcasts does not exceed the 
number of lies for which the strategy was developed then all the processors 
properly compute the value of the function. 

The above scenario is not realistic and docs not correspond precisely to 
our communication model. We show how to convert this protocol into another 
protocol in which we need neither broadcast nor error-free feedback. 

The first problem we have to overcome is the possibility of protocol deadlock. 
This happens if i(v0) ^ i(vl) and processors Pj(„o) an d Pi( v i) receive 1 and 
respectively. In this situation all processors will wait for a message forever. We 
consider semi-static strategy in which all indices at same depth in the strategy 
tree are the same. Every strategy of depth q can be easily converted into a 
semi-static one, of depth q' < kq. Moreover we can assume that all leafs in the 
new strategy tree are at the same depth. In other cases we can easily extend 
the strategy tree so that it has this property. 

We label the strategy tree with a tree code using the alphabet S4. Let S(v) 
denote the letter on the arc leading to v from its parent. 

Now we have the following protocol. Processor Pj maintains the state Vj. 
Processor Pi maintains its own state. Other processors maintain expected state 
of processor Pi. At beginning all states are the root. 

Protocol consists of rounds. In each round all states are at the same depth in 
the strategy tree. Because we consider semi-static strategy then in each round 
all processors know the same value i = i(vj). In each round processor Pj sends 
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a = [xi E F(vi)] to Pi. Say that Pi receives a. If i = 1 then Pi does not need 
to send anything to itself and we assume that a = a. Next processor Pi changes 
its state to via and sends s — S(via) to all other processors. Transmission 
can be corrupted and processors can receive different bits. Say that for j ^ 1 
processor Pj receives §j. Next Pj updates the state Vj using the tree code 
property. Protocol ends when processors reach a leaf. 

Let qi and qi be the number of messages received and sent by processor 
Pi , respectively. Let n and ri be the number of erroneous messages received 
and sent by processor Pi, respectively. The protocol works fine if T\ + 4r2 < I. 
Hence it is proof against at least 1/4 bit-errors. The protocol exchanges q% + 4^2 
bits. We have qi < q' < kq and (72 = (k — l)q' < k(k — l)q. Hence the protocol 
exchanges at most fc(4fc — 2>)q bits. Therefore it is proof against at least the 
fraction 

4fc(4fc - 3)q ^ 

of badly transmitted bits. 

We can increase robustness of the above protocol by running it parallel on 
all processors. Now each processor Pj maintains k states Vji, Uja, • • • , Vjk- The 
state Vjj is own state of the processor Pj . For m 7^ j the state Vj m is the state 
of the processor P m predicated by the processor Pj . 

Protocol consists of rounds. In each round all states are at the same depth 
in the strategy tree. Because we consider a semi-static strategy then in each 
round all processors knows the same value i = i(vj m ). In each round for each 
j ^ i the processor Pi sends a,j = [xt € F(vij)] to the processor Pj. Say that 
Pj receives a,j. If j = i then Pj does not need to send anything to itself and 
we assume that a,i = a^. Next for each j processor Pj changes state to VjjCij 
and sends Sj = S(vjjd,j) to all other processors. Say that for j ^ m processor 
Pj decodes letter from processor P m as Sj m . Then Pj updates the state Vj m 
using the tree code property. Protocol ends when processors reach a leaf. Each 
processor has k values of the function. It is sufficient that at least [k/2\ + 1 of 
them are correct. 



4 Conclusions 

Presented calculations are very rough because our principal aim is to show that 
we can construct a robust protocol from a strategy in the game and we want 
to show this as simple as possible. In fact a single bit error by transmitting a 
letter from the 4-bit tree code may not cause an error by decoding this letter. 
Probably in practice a protocol constructed from a good strategy have better 
properties than expressed by ©■ 

Further work should concentrate on proving Conjecture ^ Constructive 
proof which already gives semi-static strategy is desired. Such a proof would 
give another proof of Schulman's Coding Theorems [H]- We hope that this 
approach should give less communication overhead. 

Another direction of research is searching for an efficient deterministic con- 
struction of a tree code and development of an efficient algorithm for tree code 
decoding. A "efficient" means working in time linear in the depth of the node 
in the tree. Some advice how linear time decoding could be done is given in |2j . 
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